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Numerical Simulations of an SU(2)l ® 5^(2)#-symmetric Higgs- Yukawa 
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We report on our work on the SU(2)l ® SU(2)r symmetric Higgs Yukawa Model with mirror fermion action. 
Our model describes a fermion Higgs system in the limit of vanishing gauge coupling. Setting the bare Yukawa 
coupling of the mirror fermions G x to zero, we want to determine the triviality bounds on the renormalized 
Yukawa coupling of the fermions Gr^ and the scalar self-coupling (/a on 8 3 x 16 and 16 3 x 32 lattices. 



1. Introduction 

Fermion Higgs models with left right symme- 
tric action have been intensively studied over the 
last years (see [1] and references therein). The re- 
sults obtained so far are in good agreement with 
one loop perturbation theory but there are still 
strong finite size effects, especially at large values 
of the Yukawa coupling. 

With increasing computer power computations 
on larger lattice sizes become feasible and we can 
get a better control over the finite size effects (be- 
side the use of improved actions). The use of mas- 
sivly parallel computers boosts this development. 
The QUADRICS Q16 is one of these which has a 
very appealing price/performance ratio. 

In the following we report on our progress in 
implementing the model which has been studied 
in [1] on the Q16. We present first timing results 
of the program with dynamical fermions and de- 
tails of the implementation. 

2. The Model 

The lattice action of the model consists of a 
pure scalar part and a mixed fermion scalar part 
S = Stp + Sy . The scalar field <p is represented as a 
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2 Cg) 2 matrix <p = <f>4 + i<Jk4>k k = 1,2,3, where 
are the Pauli matrices and <f> a are real variables, 
'f is the fermion pair of fermion ip and mirror fer- 
mion x doublets, 'f = (ip,x)- Using conventional 
normalizations we have 



-Tr(<p+<p x ) - 1 



X 

4 

-K ^ ^Px + ^l^x+Xx + ^l^Xx 
M = -4 

+ r (~Xx+ji^ - Xx^ + ^x+^Xx - i> x Xx) ] 

+ G> [^ Rx ft^Lx + i) Lx fxi>Rx\ 
+ G X [XRxVxXLx +XLxVtXRx] } 

For the Hybrid Monte Carlo the number of fla- 
vours has to be doubled so that the last sum be- 
comes a sum over the lattice points and the two 
flavours. 

The choice of the seven parameters follows [1]. 
The Wilson parameter r is set to 1. In order 
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Thermalization and tuning of k 
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Measurement with equal k on all boards 
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Figure 1. Example run on the Q16 



to study the triviality bound, we set the quartic 
coupling A to infinity. We choose \i^, x = 0.0 and 
G x = 0.0 to ensure the decoupling of the mirror 
fermions in the continuum limit. The fermionic 
hopping parameter K is fixed to K cr = 0.125 [1]. 
This leaves the fermionic Yukawa coupling 
and the hopping parameter k as physical input 
parameters. We use k to tune the scalar mass and 
select the phase. 



3. Implementation on the QUADRICS 

To simulate the model we use the hybrid Monte 
Carlo algorithm with conjugate gradient for the 
matrix inversion. For information on the parame- 
ter dependence we refer to [1],[4] and [5]. 

The topology of the Q16 is 2 x 2 x 32 which is 
not well suited for a lattice of size 8 3 x 16. But it 
is possible to run the Q16 as 16 independend Ql 
boards with 2x2x2 topology and periodic boun- 
dary conditions closing on each board. Therefore 
we decided to implement the program in a sort 
of two step parallelism. First we did a geometric 
parallelization on the Ql boards in dividing the 



lattice in 8 parts and putting each one on of the 
8 CPUs. The second step is simply to run a dif- 
ferent configuration on each of the 16 Ql boards. 
The drawback of this approach seems to be that 
one has to equilibrate each configuration and so 
waste a lot of CPU time. Fortunately, this is not 
true in our case. Before we can start our mea- 
surements, we have to tune the scalar hopping 
parameter k in order to achieve a scalar mass of 
0.6 — 0.8 in lattice units and to choose the desired 
phase (here the broken, FM, phase, see [1]). Fig. 
1 shows what we do. We start on each Ql with 
a different value of k, thermalize these configura- 
tions and then determine the scalar mass on the 
16 boards. What we get is shown in Fig. 2. From 
this picture we determine the hopping parame- 
ter value for the measurements. We pick out the 
configuration which lies closest to this values, du- 
plicate it 16 times and restart the program now 
with 16 equal configurations and k values. If the 
new k value lies in between two old k values we 
do again a few thermalization sweeps, the number 
of which is of the order 10 smaller than on the 
original start configurations. At the end we get 
16 independend results from which determine the 
quantities of interest by the bootstrap method. 
Within each configuration the errors are estima- 
ted by a jackknife approach. 

The larger 16 3 x 32 lattice will be run on the 
DFG QH2 which will allow a more flexible parti- 
tioning, so that we can use the same method but 
with less configurations. 

A problem which has to be faced on the QUAD- 
RICS is its single precision floating point hardwa- 
re. There are three points in the program where 
global sums have to be calculated and where the 
low precision can cause problems. In the metro- 
polis update at the end of each trajectory small 
errors in the summation produce rather big effects 
on the update probability due to the exponential 
function involved. Errors in the scalar products, 
which have to be evaluated during the conjuga- 
te gradient iterations, influence the convergence 
in an unpredictable manner. And, last not least, 
in the measurement routine, where we expect the 
errors to play a less important role. We have de- 
cided to use in all these cases a summation proce- 
dure where the terms are added tree like, so that 
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Figure 2. k tuning at = 0.3 



at every step terms of equal size are added. 

For the random numbers we use the internal 
rand 16 library routine, so we have for each of the 
128 CPUs an independend generator. 

A QUADRICS feature which caused some pro- 
blems is the limitation on the size of the executa- 
bles. This limit lies at ps 2.6MB, a size which can 
be reached very quickly by using compile time for 
loops even if the length of these loops is small (in 
our program typically 2-4). 

4. Timing results 

The most time consuming part in the simula- 
tion is the conjugate gradient. It utilizes almost 
95% of the whole CPU time. We have compared 
the times for a single matrix inversion with times 
from a CRAY YMP. For a fixed number of itera- 
tions we are about 10% faster on the Ql. If we 
use the residual as the stopping criteria, which is 
of course more realistic, we are about 10% slower, 
which results from an increase in the number of 
iterations. We think this increase is due to sum- 
mation errors in the scalar product, but we will 
have to clarify this point a bit more, before going 



to larger lattices. Compared to the peak perfor- 
mance we achieved ps 45% in the conjugate gradi- 
ent. These values are in good coincidence with the 
results obtained with the performance analyzer. 

Lattices of size 4 3 x 8 can be run either on a 
single CPU or, geometrically parallelized, on a 
eight CPU Ql board. We have measured a speed 
up factor of 6 for one hybrid trajectory. 

Comparing the times for different lattice sizes 
on the Ql, an increase in the performance, measu- 
red in MFLOPS, with increasing lattice size can 
be seen. This is due to a better filling of the pi- 
peline. 

To sum up, we can say that the Q16 gives for 
our problem an increase in computer power of a 
factor 15 compared to a CRAY YMP. This allows 
us to achieve high statistics in a reasonable time 
on lattice sizes up to 16 3 x 32. The main difficul- 
ties are the limitation on the code size and the 
single precision arithmetic. 
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